Appearance of Random Matrix Theory in deep learning

نویسندگان

چکیده

We investigate the local spectral statistics of loss surface Hessians artificial neural networks, where we discover excellent agreement with Gaussian Orthogonal Ensemble across several network architectures and datasets. These results shed new light on applicability Random Matrix Theory to modelling networks suggest a previously unrecognised role for it in study surfaces deep learning. Inspired by these observations, propose novel model true consistent our which allows Hessian densities rank degeneracy outliers, extensively observed practice, predicts growing independence gradients as function distance weight-space. further importance find, contrast previous work, that exponential hardness locating global minimum has practical consequences achieving state art performance.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Nonlinear random matrix theory for deep learning Nonlinear random matrix theory for deep learning

Neural network configurations with random weights play an important role in the analysis of deep learning. They define the initial loss landscape and are closely related to kernel and random feature methods. Despite the fact that these networks are built out of random matrices, the vast and powerful machinery of random matrix theory has so far found limited success in studying them. A main obst...

متن کامل

Nonlinear random matrix theory for deep learning

Neural network configurations with random weights play an important role in the analysis of deep learning. They define the initial loss landscape and are closely related to kernel and random feature methods. Despite the fact that these networks are built out of random matrices, the vast and powerful machinery of random matrix theory has so far found limited success in studying them. A main obst...

متن کامل

Universality in Random Matrix Theory

which is the Central Limit Theorem. In principle, all the random variables X1, X2, · · · , XN can be of order 1, hence SN ∼ 1 as well, but the probability of having such a rare event is incredibly small. We can even estimate the bound on the probability for the rare event from the large deviation principle. A similar phenomenon happens when we form a large matrix from i.i.d. random variables an...

متن کامل

Developments in Random Matrix Theory

In this preface to the Journal of Physics A, Special Edition on Random Matrix Theory, we give a review of the main historical developments of random matrix theory. A short summary of the papers that appear in this special edition is also given.

متن کامل

Random Matrix Theory

Random matrix theory is usually taught as a sequence of several graduate courses; we have 16 lectures, so we will give a very brief introduction. Some relevant books for the course: • G. Anderson, A. Guionnet, O. Zeitouni. An introduction to random matrices. [1] • A. Guionnet. Large random matrices: lectures on macroscopic asymptotics. • M. L. Mehta. Random matrices. The study of random matrice...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Physica D: Nonlinear Phenomena

سال: 2022

ISSN: ['1872-8022', '0167-2789']

DOI: https://doi.org/10.1016/j.physa.2021.126742